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Abstract 


Based  on  the  ripple  analysis  model  outlined  in  Part  I  of  this  paper,  predictions 
are  made  for  the  detection  of  shape  changes  in  spectral  peak  profiles.  Peak  shape 
is  uniquely  described  in  terms  of  two  parameters:  bandwidth  factor  (BWF)  which 
reflects  the  tuning  or  sharpness  of  a  peak,  and  a  symmetry  factor  (SF)  which  roughly 
measures  the  local  evenness  or  oddness  of  a  peak.  Using  profile  analysis  methods, 
thresholds  to  changes  in  these  parameters  (defined  as  6BWF/BWF  and  £SF)  are 
measured  together  with  the  effects  of  several  manipulations  such  as  using  different 
peak  levels,  varying  spectral  component  densities,  and  randomizing  the  frequencies 
of  the  peaks.  The  new  ripple  analysis  model  accounts  well  for  the  measured  thresh¬ 
olds.  Predictions  of  the  three  previously  published  models  for  the  same  profiles  are 
also  evaluated  and  discussed. 

INTRODUCTION 

The  shape  of  the  acoustic  spectrum  is  a  fundamental  cue  in  the  perception  and 
recognition  of  complex  sounds.  It  is,  however,  uncertain  how  this  spectrum  is  repre¬ 
sented  in  the  auditory  system  and  what  specific  features  are  extracted  and  empha¬ 
sized  by  such  a  representation.  In  Part  I  of  this  paper  [  Vranic-Sowers  and  Shamma, 
19xx],  it  is  proposed  that  the  auditory  system  transforms  the  spectral  profile  into  a 
weighted  sum  of  sinusoidal  spectra  -  called  ripples.  This  ripple  analysis  is  carried  out 
by  a  bank  of  filters  tuned  to  ripples  of  different  frequencies  and  phases.  Perception 
of  the  profile  is  then  based  on  the  ripple  transform  of  the  profile  and  not  on  the  pro¬ 
file  itself.  This  hypothesis  is  partly  supported  by  physiological  data  demonstrating 
that  the  primary  auditory  cortex  (AI)  is  indeed  capable  of  carrying  out  the  ripple 
analysis  implied  by  the  model  [Calhoun  and  Schreiner ,  1993;  Shamma,  Versnel  and 
Kowalski ,  1993]. 

To  test  this  hypothesis,  a  computational  model  was  developed  and  applied  suc¬ 
cessfully  to  the  prediction  of  perceptual  thresholds  of  several  well  known  profile  anal¬ 
ysis  experiments  [  Vranic-Sowers  and  Shamma ,  19xx].  However,  to  distinguish  this 
model  from  other  profile  analysis  models  (e.g.,  the  maximum  difference  [Bernstein 
and  Green ,  1987]  and  the  independent  channels  models  [Durlach,  Braida  and  Ito , 
1986]),  several  predictions  were  made  regarding  the  detection  of  complex  changes 
in  arbitrary  profiles  [Vranic-Sowers  and  Shamma ,  19xx].  The  most  important  is 
that  sensitivity  to  shifts  in  the  “magnitude”  and  “phase”  of  the  ripple  transform  of 
an  arbitrary  profile  should  be  comparable,  respectively,  to  the  frequency-difference- 
limen  (fdl)  and  phase-difference-limen  (pdl)  thresholds  measured  with  single  ripple 
profiles  [Green,  1986;  Hillier ,  1991;  Vranic-Sowers  and  Shamma ,  19xx]. 

These  predictions  are  tested  here  using  a  spectral  peak  profile  similar  to  a  vowel 
formant.  Subjects’  sensitivities  to  ripple  transform  shifts  are  estimated  from  their 
perceptual  thresholds  to  changes  in  several  peak  shape  parameters.  Results  are  then 
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compared  to  predictions  from  the  ripple  analysis  model  and  also  from  the  indepen¬ 
dent  channels  model  [Durlach,  Braida  and  Ito,  1986],  the  maximum  difference  model 
[Bernstein  and  Green ,  1987],  and  the  Ewaif  model  [Stover  and  Feth,  1983]. 

In  the  following  section  (Sec.  I),  the  acoustic  stimuli  are  described  in  detail  and 
their  ripple  transforms  are  computed.  In  Sec.  II,  changes  in  peak’s  bandwidth,  sym¬ 
metry,  and  amplitude  are  interpreted  in  the  ripple  transform  domain,  respectively, 
as  fdl-,  pdl-,  and  pedestal-type  experiments,  in  the  sense  defined  in  Part  I  of  this 
paper  [Vranic-Sowers  and  Shamma,  19xx],  and  detection  thresholds  for  these  three 
types  of  experiments  are  predicted  for  spectral  peak  profiles.  In  Sec.  Ill,  experimen¬ 
tal  results  are  reported  and  compared  to  predictions  of  the  ripple  analysis  model. 
Predictions  of  the  three  alternative  models  for  the  same  experiments  and  further 
general  discussion  of  the  results  are  presented  in  Secs.  IV  and  V,  respectively. 

I.  DESCRIPTION  OF  THE  PEAK  PROFILES 

A.  Spectral  peak  stimulus  parameters 

The  peak  profile  was  defined  as  the  envelope  of  a  broadband  stimulus  consisting 
of  N  components  equally  spaced  on  a  logarithmic  frequency  axis  (uj),  where  u>  = 
log2  [f/fo )  octaves,  /  is  the  frequency  (in  kHz),  and  fa  is  the  frequency  of  the  largest 
peak  component  (Fig.  1(a)).  The  stimulus  spectrum  consisted  of  two  portions,  the 
base  and  the  peak.  The  base  components  were  all  equal  in  amplitude  and  they  were 
added  in  phase  to  peak  components  of  different  symmetries  and  bandwidths  to  form 
the  peak  profiles.  The  peak  profile  is  defined  in  terms  of  the  following  parameters 
(Fig.  1(a)): 

•  u0  is  the  location  of  the  peak’s  maximum. 

•  S  is  the  slope  of  the  profile  near  the  peak’s  maximum  (in  dB/octave).  For 
u>  <  u>0i  S  =  L  (the  left  slope),  and  for  u>  >  u>0,  S  =  R  (the  right  slope). 

•  b(u>)  —  b  is  the  flat  base  of  the  peak  profile. 

•  a(u>)  —  amax  ■  is  the  amplitude  of  the  peak  portion  of  the  profile. 

amax  is  the  maximum  amplitude  of  the  peak  profile  (at  u  =  u>0).  It  is  also 
defined  in  dB  as  Amax  =  20  log10(£lyii). 

Therefore,  the  overall  peak  profile  (on  a  linear  scale)  is  given  by: 

pH  =  b(co)  +  a((j)  =  b  +  amax 


and  on  a  dB  scale: 

PdB( u)  =  20  log10(6  +  amax10^"-^)  =  20  log10(6  (1  +  lO^^"-^)). 
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Frequency,  kHz 


Figure  1:  (a)  Peak  profile  plotted  on  a  linear  (top)  and  logarithmic  (bottom)  am¬ 
plitude  scale.  Peak  level  ( Amax )  is  15  dB,  and  BWF  =  0.1  octave  and  SF  =  0. 
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For  example,  the  peak  in  Fig.  1(a)  is  15  dB  in  level  ( Amax )  with  slopes  L  -  60 
dB/octave  and  R  =  —60  dB/octave  around  the  peak.  Note  that  around  u>0  the  peak 
profile  can  be  approximated  by: 

Pjb(w)  «  20  log10(6- 10^+^(a'“"Jo))  =  20  log10  b  +  Amax  +  S(u-u0), 

i.e.,  the  peak  has  approximately  a  triangular  profile  as  shown  in  Fig.  1(a)  (lower 
panel) . 

In  all  experimental  conditions,  standard  and  signal  consisted  of  TV  =  11,  21,  or  41 
zero  phase  components  equally  spaced  on  a  logarithmic  scale  between  0.2-5  kHz  (u> 
in  the  range  ±2.32  octave),  with  the  peak  always  centered  at  1  kHz  ( u>0  —  0  octave). 
The  amplitude  of  each  component  p,-  in  the  stimulus  can  be  computed  from: 

Pi  =  b  +  amax  10'(,_‘o),  for  i  <  i0 , 

and 

Pi  =  b+  amax  10r(,“‘o),  for  i  >  i0, 

where  i  is  the  component  index,  i0  is  the  index  of  the  largest  peak  component, 
/  =  (L/ 20)  •  ( M/N ),  r  =  (R/ 20)  •  ( M/N ),  and  M  is  the  frequency  range  of  the 
spectrum  in  octaves  (M  =  4.64  octave).  For  our  centered  peaks,  i0  =  (TV  +  l)/2. 

No  other  (than  zero)  phase  conditions  were  tested  since  numerous  previous  ex¬ 
periments  have  shown  that  phase  effects  on  signal  detection  are  minimal  ([Bernstein, 
Richards  and  Green ,  1987;  Green  and  Mason ,  1985]). 

B.  Peak  shape  parameters 

In  order  to  describe  its  shape  efficiently,  a  peak  profile  was  parametrized  uniquely 
in  terms  of  a  bandwidth  factor  (BWF)  and  a  symmetry  factor  (SF).  BWF  is  the 
bandwidth  of  the  peak  (measured  at  3  dB’s  below  the  peak)  and  is  defined  as: 
BWF  =  3  (1/L  -  1/R)  octave.  The  SF  represents  the  percentage  difference  of  the 
slopes  around  the  peak  and  is  defined  as:  SF  =  (L  +  R)  /  (L  -  R).  For  instance, 
the  peak  in  Fig.  1(a)  has  BWF  =  0.1  octave  and  SF  =  0.  Peaks  with  various  other 
BWF’s  and  SF’s  are  shown  in  Fig.  1(b)  covering  the  full  range  of  profiles  used  in  our 
experiments.  Conversely,  given  any  BWF  and  SF,  the  slopes  around  the  peak  can 
be  computed  as:  R  =  -6/(BWF  (1  +  SF))  dB/octave,  and  L  =  6/(BWF  (1  -  SF)) 
dB/octave.  A  third  parameter  -  the  peak  level  (Amax)  is  also  required  to  define  the 
peak  completely  with  respect  to  the  baseline. 

To  make  the  spectral  peaks  asymmetric,  they  were  always  tilted  towards  higher 
frequencies  (or  to  the  right).  This,  together  with  choosing  the  peak  frequency  at 
1  kHz  and  limiting  the  range  of  BWF  values  under  0.4  octave,  ensured  that  the  spec¬ 
tral  peaks  were  located  above  500  Hz  where  the  cochlear  frequency  axis  is  assumed 
largely  logarithmic.  This  is  an  important  consideration  since  the  peak  shapes  used 
were  explicitly  defined  in  terms  of  spectral  slopes  along  such  an  axis.  The  largest 
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p(co) 


BWF=0.1  BWF=0.2  BWF=0.4 


to,  octave  co,  octave  to,  octave 


Figure  1:  (b)  Various  peak  profiles  plotted  on  a  linear  amplitude  axis.  Columns 
share  the  same  BWF’s  and  rows  share  the  same  SF’s.  Corresponding  left  and  right 
slope  values  (in  dB/octave)  are  shown  for  each  case. 


5 


BWF  and  SF  values  were  also  constrained  by  the  need  to  ensure  that  the  profile 
substantially  decayed  before  reaching  the  edges  of  the  base  (at  0.2  and  5  kHz).  Fi¬ 
nally,  the  range  of  BWF  and  SF  values  tested  corresponds  to  those  that  might  be 
computed  from  the  spectral  envelope  of  speech  sounds,  as  shown  in  Fig.  1(c). 


Figure  1:  (c)  SF’s  (top)  and  BWF’s  (in  octaves,  bottom)  for  the  spectral  peaks  of 
a  naturally  spoken  vowel  “aw” . 


C.  Ripple  transform  of  the  peak  profile 

According  to  the  ripple  analysis  model  [Vranic-Sowers  and  Shamma ,  19xx],  the 
detection  of  peak  shape  changes  is  based  on  the  ripple  transform  representation  of  the 
peak.  In  order  to  predict  perceptual  thresholds  for  the  fdl-,  pdl-,  and  pedestal-type 
experiments,  we  need  to  examine  the  peak’s  ripple  transform,  r(.),  and  specifically 
how  BWF  and  SF  are  interpreted  in  this  domain. 

First,  the  ripple  spectrum  P(fi)  of  the  peak  profile  is  for  fi  >  0  cycle/octave 
computed  as: 

p(n)  =  \p(n)\eje  =  amax  2°  RWF  • 

V  ’  1  V  n  b  3  In  10  M 

1 

1  +  j  2ttO  20/(3  In  10)  SF  BWF  +  (ttO  20/(31n  10)  BWF)2(1  -  SF2)  ’ 
where  the  magnitude  of  the  ripple  spectrum  is: 


\pm  = 


® max  C  BWF _ 1 _ 

6  *  M  y/(l  +  fi2  (c  BWF)2(1  -  SF2))2  +  H2(2c  SF  BWF)2  ’ 
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c  =  20  7r / (3  In  10)  «  9.1  and  |P(0)|  =  1  +  af*  c  (Fig.  2(a))1.  Using  this 
expression,  the  magnitude  of  the  ripple  transform,  r(fi0),  is  computed  from  Eq.  (4) 
in  [  Vranic-Sowers  and  Shamma,  19xx]  and  displayed  in  Fig.  2(b)  for  two  symmetric 
peaks  with  different  BWF’s  (0.1  and  0.4  octaves).  Note  that  the  steepest  lowpass 
edges  of  the  ripple  transforms  (marked  by  the  dashed  lines)  are  located  roughly  at 
U0  =  0.85  and  3.3  cycle/octave. 

II.  THRESHOLD  PREDICTIONS  OF  THE  RIPPLE  ANALYSIS  MODEL 

A.  Predictions  for  fdl-type  measurements  (Dilation  thresholds) 

For  a  peak  p(u ),  a  relative  change  in  BWF  (^ypp')  is  equivalent  to  dilating 

the  peak  by  a  factor  a  =  — jhwv  t°  P(a  u>)-  ripple  transform  domain 

1+'BWF“ 

this  corresponds  to  translating  the  magnitudes  of  the  ripple  spectrum  and  ripple 
transform  of  a  peak  (Sec.  I  B  in  [Vranic-Sowers  and  Shamma ,  19xx])  by  log2  a 
octave  on  the  logarithmic  Q0  axis  (Figs.  2). 

Since  the  locations  of  the  steepest  lowpass  edges  of  the  ripple  transforms  of  all 
peaks  used  here  fall  in  the  range  f l0  =0.85-7  cycle/octave,  then  the  ripple  analysis 
model  (A  curve  in  Fig.  3(c)  in  [Vranic-Sowers  and  Shamma ,  19xx])  predicts  that 
subjects  should  detect  a  0.26  octave  shift  in  r(Cl0).  Since  we  always  increased  BWF 
in  our  experiments  (i.e.,  <*>BWF>  0  octave),  then  A  =  —0.26  octave  (=  log2  a),  i.e., 
a  =  0.83.  This  corresponds  to  qywp  “  0-2,  or  a  20%  increase  in  BWF  regardless 
of  the  standard  peak’s  bandwidth.  This  estimate  is  also  independent  of  the  peak’s 
symmetry  (SF)  since  the  latter  mostly  reflects  the  phase  (not  the  magnitude)  of  the 
ripple  transform  (see  discussion  below). 


1Note  that  these  expressions  can  be  simplified  for  the  range  of  parameters  used  in  our  experi¬ 
ments.  For  instance,  for  small  SF’s  ( SF 2  <C  0.5)  the  magnitude  is  independent  of  the  SF: 

|P('Q'i|  ~  0max  c  BWF _ 1 

1  V  11  b  7T  M  1  +  f22  (c  BWF)2' 

For  SF 2  <  1,  the  phase  function  is  given  by: 


tan(0(fi))  «  0(fi)  « 


g  (2c  BWF) 

1  +  fi2  (c  BWF)2 


which  becomes 

0(0)  »  SF 

for  0  ~  c  BWF ’  1,e'’  aroun<^  ^  =  0.3, 0.5, 1.1  cycle/octave  for  BWF=  0.4,  0.2,  0.1  octaves, 
respectively.  Note  that  these  are  the  fi’s  at  which  the  magnitude  of  the  ripple  transform  of  the 
peak  r(00)  is  maximum  (Fig.  2(b))  and  hence  where  the  bulk  of  the  profile  energy  lies.  The 
correspondence  above  between  the  0(CI)  and  SF  is  more  analytically  established  in  Appendix  A. 
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Q,  cycle/octave 


£2o,  cycle/octave 


Figure  2:  (a)  Ripple  spectra  of  peak  profiles  with  Amax  =  15  dB,  SF’s=0,  and  BWF’s 
=  0.1  and  0.4  octaves,  (b)  Ripple  transform  magnitudes,  r(fi0),  of  ripple  spectra  in 
(a).  The  effect  of  a  BWF  change  is  a  shift  (and  not  a  change  in  shape)  of  the  ripple 
transform  along  the  logarithmic  fi0  axis.  For  example,  a  four-fold  increase  in  BWF 
(from  0.1  to  0.4  octaves),  i.e.,  =  3  or  a  —  0.25  results  in  a  (A  =  log2  a  =)  2 

octave  downward  shift  in  r(J70). 
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Figure  3:  (a)  The  effect  on  a  change  in  the  symmetry  of  a  peak  profile  (BWF 
=  0.4  octave,  SF  =  0)  due  to  adding  constant  phases  (3°,  9°,  18°,  or  0.05,  0.16, 
0.31  radians)  to  its  Fourier  transform,  (b)  Changes  in  the  symmetry  of  a  peak 
(BWF=0.4  octave,  SF=0,  dashed  line)  due  to  adding  a  constant  phase  11.5°  =  0.2 
radians  (dotted  line)  and  due  to  changing  the  SF  by  <5SF  =  0.2  (solid  line). 
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B.  Predictions  for  pdl-type  measurements  (Ripple  phase  sensitivity) 

Adding  a  small  constant  phase  angle  to  all  components  of  the  peak  ripple  trans¬ 
form  r(-)  translates  the  phase  function  of  the  ripple  transform  and  changes  the  peak’s 
symmetry  or  SF,  as  demonstrated  in  Figs.  3.  In  Appendix  A  it  is  shown  that  £SF 
is  approximately  equal  to  the  ripple  phase  of  the  peak  profile  (in  radians).  Hence, 
in  all  measurements  reported  here,  the  peak’s  ripple  phase  was  manipulated  simply 
by  changing  the  SF  directly  (i.e.,  the  slopes  L  and  R  around  the  peak)  rather  than 
via  forward- inverse  ripple  transform  operations. 

The  ripple  phase  sensitivity  for  peak  profiles  can  be  predicted  from  the  ripple 
analysis  model  as  follows.  For  the  broader  peaks  (BWF  >0.1  octave)  the  bulk 
of  the  ripple  transform  (Fig.  2(b))  lies  in  the  low  ripple  range  (<  1  cycle/octave). 
Therefore,  phase  sensitivity  is  predicted  to  be  constant  at  approximately  6°  or  0.1 
radians  (Fig.  4(b)  in  Part  I,  the  25  dB  curve).  Equivalently,  subjects  should  be  able 
to  detect  a  SF  change  (^SF)  of  approximately  0.1.  This  threshold  estimate  would 
increase  for  the  narrowest  peak  (BWF  =  0.1  octave)  as  the  ripple  transform  shifts 
upwards  towards  the  range  of  >  1  cycle/octave.  Note  that,  as  for  the  fdl-  type 
tests,  none  of  these  predictions  depend  on  the  standard’s  peak  symmetry. 

C.  Predictions  for  pedestal-type  measurements  (Peak  amplitude  sensitiv¬ 
ity) 

The  ripple  analysis  model  is  basically  linear  and  hence  pedestal-type  experiments 
should  yield  comparable  thresholds  regardless  of  the  exact  shape  of  the  spectral  pro¬ 
file.  The  only  data  previously  available  for  such  experiments  used  a  single  component 
pedestal  profile  (Fig.  5.4  in  [Green,  1988]).  Specifically,  thresholds  ranged  between 
-8  and  -16  dB’s  for  pedestals  up  to  24  dB  larger  than  the  base.  The  variability  is 
attributed  to  masking  effects  depending  on  the  spectral  density  in  the  vicinity  of 
the  signal  [Green,  1988]. 

III.  MEASURED  THRESHOLDS  FOR  PEAK  SHAPE  CHANGES 
A.  Methods 

The  experiments  are  similar  in  methodology  to  previously  reported  profile  analy¬ 
sis  experiments  ([ Bernstein  and  Green,  1987;  Bernstein,  Richards  and  Green,  1987; 
Green,  Mason  and  Kidd,  1984;  Kidd,  Mason  and  Green,  1986]).  The  specifics 
of  the  experimental  procedures  used  are  identical  to  those  described  in  Part  I 
(Sec.  II  B.2(i))  of  this  paper  [Vranic-Sowers  and  Shamma,  19xx].  Again,  the  wave¬ 
form  was  turned  on  10  ms  following  the  onset  in  order  to  suppress  the  large  ampli¬ 
tudes  due  to  zero  phases. 

The  step  size  was  defined  in  terms  of  a  change  in  the  right  slope  of  the  peak  and  it 
differed  across  the  testing  conditions,  ranging  from  0.3  dB/octave  -  5.5  dB/octave. 

Threshold  measure  was  defined  as  the  amount  of  change  in  BWF  or  SF  needed  for 
detection  (for,  respectively,  a  constant  SF  or  BWF  testing  condition),  i.e.,  <5BWF  or 


10 


<5SF.  In  the  case  of  BWF  change  tests,  the  normalized  by  the  peak’s  BWF  thresholds 
were  used  (i.e.,  <5BWF/BWF). 

The  results  reported  are  based  on  data  from  two  to  five  normal  hearing  subjects, 
depending  on  the  particular  test. 

B.  Detection  of  change  in  bandwidth  (Dilation  thresholds) 

Experiments  here  measured  detectability  of  a  dilation  (stretching)  of  the  spectral 
peak  or,  equivalently,  of  an  increase  in  its  BWF,  while  holding  the  peak  symmetry 
constant.  To  recognize  the  signal,  subjects  reported  listening  to  several  different 
sound  qualities,  e.g.,  pitch  and  sharpness  of  sound.  Some  reported  changing  strate¬ 
gies  depending  on  the  testing  conditions. 

In  the  parameter  range  tested,  dilation  thresholds  remained  relatively  constant 
at  a  «  0.8  (or  <$BWF/BWF  «  0.22)  regardless  of  standard’s  peak  shape  (Figs.  4). 
This  result  is  consistent  with  predictions  of  the  ripple  analysis  model  discussed  above 
in  Sec.  II  A. 

Thresholds  only  slightly  decreased  for  higher  peak  amplitudes  (i.e.,  >  15  dB  in 
level),  but  increased  sharply  for  lower  amplitudes  perhaps  due  to  masking  effects  of 
the  base  upon  the  smaller  peak  profiles  (Appendix  B.l,  Fig.  Bl).  Thresholds  were 
also  relatively  independent  of  spectral  component  density  (Fig.  B2). 

Two  conclusions  can  be  drawn  from  the  above  results.  First,  the  constancy  of 
the  dilation  thresholds  in  the  peak  profiles  is  directly  related  to  the  constancy  of  the 
fdl’s  in  the  0.85  —  7  cycle/octave  range  (A  curve  in  Fig.  3(c)  in  [Vranic-Sowers  and 
Shamma,  19xx]).  Second,  the  measured  dilation  thresholds  hold  regardless  of  the 
exact  details  of  the  peak  shape  since  the  thresholds  depend  only  on  the  shift  and 
not  on  the  shape  of  the  ripple  transform  magnitude  (Sec.  II  B.l  in  [Vranic-Sowers 
and  Shamma,  19xx]). 

C.  Detection  of  change  in  symmetry  (Ripple  phase  sensitivity) 

These  experiments  measured  subjects’  sensitivity  to  a  change  in  peak  symmetry 
while  holding  its  BWF  constant  or,  equivalently,  a  shift  in  the  phase  of  the  ripple 
transform  of  the  peak  profile.  Subjects  trained  relatively  quickly  to  distinguish  signal 
from  standard,  reportedly  based  on  pitch-like  cues.  This  “pitch”  effect  is  further 
explored  in  Sec.  IV  C. 

In  the  parameter  range  tested,  ripple  phase  sensitivity  remained  relatively  con¬ 
stant  at  approximately  6°  (or  6SF  0.11)  regardless  of  the  peak  shape  (Figs.  5). 
There  was,  however,  a  consistent  small  increase  in  threshold  as  a  function  of  de¬ 
creasing  BWF  (Fig.  5(b)),  especially  at  the  narrowest  peaks  (BWF=  0.1  octave) 
where  thresholds  rose  to  10°  (5SF  ~  0.17). 

Thresholds  changed  only  slightly  for  different  peak  amplitudes  and  spectral  com¬ 
ponent  densities  (Figs.  B3  and  B4,  Appendix  B.2).  The  major  exception  was  the 
narrowest  peak  where  decreasing  spectral  density  caused  detection  thresholds  to  rise 
significantly. 
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Figure  4:  Bandwidth  factor  change  detection  thresholds,  6BWF/BWF,  for  41  fre¬ 
quency  components  and  15  dB  peak  level  (which  allowed  the  peak  to  be  heard 
clearly),  are  averaged  over  three  listeners  and  three  BWF’s  (0.1,  0.2,  0.4  octaves) 
in  (a)  and  five  SF’s  (0,  0.1,  0.15,  0.2,  0.4)  in  (b).  The  error  bars  are  the  standard 
deviations  of  the  means. 
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Figure  5:  Symmetry  factor  change  detection  thresholds,  <5SF,  for  41  component 
complex  and  15  dB  peak  amplitude,  are  averaged  over  five  subjects  and  four  BWF’s 
(0.1,  0.13,  0.2,  0.4  octaves)  in  (a)  and  five  SF’s  (0,  0.1,  0.15,  0.2,  0.4)  in  (b).  Note 
the  6SF  increase  for  the  narrowest  BWF  in  (b).  The  error  bars  are  the  standard 
deviations  of  the  means. 
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Ripple  phase  sensitivity  measurements  shown  here  are  similar  to  those  previously 
obtained  with  ripple  profiles  (Sec.  II  B.2(iii)  in  Part  I  [Vranic-Sowers  and  Shamma , 
19xx]),  in  that  both  exhibit  a  constant  minimum  threshold  of  approximately  6°  at 
low  ripple  frequencies  (or,  equivalently,  for  broad  peaks)  which  gradually  rises  at 
higher  ripples  (or  narrow  peaks).  These  results,  therefore,  are  consistent  with  the 
predictions  of  the  ripple  analysis  model. 

D.  Detection  of  change  in  peak  amplitude  (Peak  amplitude  sensitivity) 

Thresholds  for  a  change  in  the  amplitude  of  the  peak  (test  A  in  Fig.  6)  were 
measured  in  the  range  -7  to  -10  dB’s  (values  are  given  in  the  parentheses  in  Fig.  6) 
for  all  peak  standards.  While  they  were  independent  of  the  SF’s,  thresholds  were 
slightly  lower  for  the  narrowest  peaks  (BWF  =  0.1  octave).  These  thresholds  are 
comparable  to  those  obtained  with  single  component  pedestals.  They  are  also  con¬ 
sistent  with  the  notion  that  this  linear  ripple  analysis  model  (which  does  not  take 
into  account  explicitly  the  masking  effects)  may  be  an  adequate  first-order  approx¬ 
imation  for  pedestal- type  experiments  with  larger-than-base  pedestal  profiles. 

This  experiment  also  serves  in  a  different  way  as  a  control  to  rebut  the  argument 
that  dilation  thresholds  may  simply  reflect  the  detection  of  a  change  of  total  peak 
profile  energy,  rather  than  of  bandwidth  per  se.  This  is  because  in  all  dilation  tests 
(Sec.  Ill  B)  the  energy  was  not  equalized  as  the  peak  bandwidth  was  altered.  In  this 
experiment,  BWF  was  kept  constant  as  the  peak  amplitude  increased.  So,  if  the  two 
tasks  were  based  on  energy  change,  then  the  rms-thresholds  should  be  comparable. 
The  plots  in  Fig.  6  demonstrate  that  this  is  not  the  case,  since  rms-thresholds 
for  dilation  tests  are  uniformly  lower  by  approximately  6  dB’s,  i.e.,  subjects  are 
specifically  sensitive  to  a  change  in  peak  bandwidth. 

IV.  THRESHOLD  PREDICTIONS  BY  OTHER  PROFILE  ANALYSIS 
MODELS 

In  this  section  detection  thresholds  measured  for  peak  and  ripple  profiles  are 
compared  to  predictions  from  three  different  models  of  profile  analysis:  the  inde¬ 
pendent  channels  model  [Durlach,  Braida  and  Ito ,  1986],  the  maximum  difference 
model  [Bernstein  and  Green ,  1987],  and  the  Ewaif  model  [Feth,  O’Malley  and  Ram¬ 
sey ,  1982;  Stover  and  Feth ,  1983].  The  first  two  models  operate  on  the  profile  directly 
and  assume  that  profile  changes  are  conveyed  by  independent  channels  distributed 
across  the  spectrum.  The  third  model  bases  detection  on  a  derived  “pitch-like” 
measure  (the  Ewaif  pitch)  from  the  profile.  The  validity  of  predicted  thresholds  is 
judged  based  on  the  constancy  of  the  d'a  index  (<r  is  the  variance  of  the  channels, 
see  below)  and  the  Ewaif  pitch  computed  at  perceptual  thresholds. 

A.  The  channel  model 

This  model  is  described  in  detail  in  [Durlach,  Braida  and  Ito ,  1986;  Green ,  1988]. 
It  combines  information  from  N  independent  noisy  channels  whose  variances  (a) 
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Figure  6:  Detection  thresholds  for  a  peak  level  change  (test  A)  compared  to 
a  BWF  change  (test  B).  The  rms-threshold  for  the  two  tests  is  defined  as 

20  log  \/EiIi(Ap!7pi)2,  where  A p,  is  the  change  in  the  amplitude  of  the  ith  com¬ 
ponent  at  threshold,  pi  is  the  amplitude  of  the  ith  component  in  the  standard,  and 
N  is  the  number  of  components.  The  values  in  the  parentheses  are  computed  as 
20  log  ApN+i/i/pN+i/2,  where  pn+i/2  is  the  central  component  of  the  pedestal  stan¬ 
dard.  Three  subjects  participated  in  this  series  and  were  tested  at  two  SF’s  (0,  0.4) 
and  three  BWF’s  (0.1,  0.2,  0.4  octaves),  for  a  41  component  complex  and  at  15  dB 
standard  peak  level. 
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are  assumed  to  be  constant.  Some  interdependence  between  the  channel  outputs 
is  introduced  because  of  the  level  randomization  in  the  experiments.  The  uniform 
roving  level  distribution  over  a  20  dB  range  (07?  =  5.6)  is  approximated  by  a  normal 
distribution  of  ctr  =  5.  Furthermore,  it  is  assumed  that  the  channel  variances  are 
such  that  <jr  •  N  a.  The  level  difference  between  the  standard’s  and  signal’s 
ith  component  is  defined  as  A,-  ==  20  log((pt)  signal /(pi)  standard)-  These  assumptions 
lead  to  d'  =  ^/(£3A2  ~  ^f(I3  A,-)2) / <r.  The  numerator  (  or  d'a)  was  computed  at 
perceptual  thresholds  for  different  testing  conditions  (Tables  I)  and  at  the  limits  of 
the  error  bars,  in  order  to  determine  its  sensitivity  to  threshold  changes.  Since  no 
explicit  a  values  are  used,  absolute  threshold  values  will  not  be  predicted.  Instead, 
we  shall  be  mostly  concerned  with  the  trends  in  the  data  and  with  the  relative 
threshold  values  for  different  profiles. 

For  ^BWF  tests  all  d'a' s  (with  one  exception)  are  comparable  when  considering 
the  significant  overlap  due  to  the  error  bars  (Table  1(a)).  Thus,  the  model  predicts 
that  dilation  thresholds  for  peak  profiles  are  constant  regardless  of  the  shape  of  the 
standard  peak.  This  is  consistent  with  the  measured  thresholds. 

For  £SF  tests  the  stimuli  are  “balanced”  (see  [Durlach,  Braula  and  Ito,  1986]) 
in  that  ]FA{  w  0  or  at  least  (13  A;)2  <C  N^A?.  Here,  the  channel  model  makes 
two  erroneous  predictions  (Table  1(b)):  (1)  As  a  function  of  SF  it  predicts  a  small 
consistent  decrease  in  thresholds  that  is  not  evident  in  the  data;  (2)  As  a  function  of 
BWF  it  predicts  that  phase  sensitivity  improves  with  increasing  BWF’s,  rather  than 
being  constant  for  broad  peaks  (BWF=0.2  and  0.4  octaves)  as  the  data  indicate. 

Interestingly,  the  model  is  able  to  predict  a  constant  phase  sensitivity  (Table  1(c)) 
for  low  frequency  ripples  (<  1  cycle/octave)  when  applied  to  single  ripple  profiles 
discussed  in  Sec.  II  B.2  in  [Vranic-Sowers  and  Shamma,  19xx].  It  is  unclear  why 
this  constancy  of  thresholds  is  not  seen  with  the  peak  profile  predictions2. 

B.  The  maximum  difference  model 

The  maximum  difference  model  described  in  [Bernstein  and  Green ,  1987]  is 
based  on  detecting  the  largest  difference  between  any  pair  of  components  in  the 
signal,  i.e.,  it  uses  only  two  channels  in  computing  the  thresholds.  The  model 
was  derived  from  experimental  results  with  flat  standards  and  was  defined  ac¬ 
cordingly  for  such  tests.  It  predicts  well  the  thresholds  in  a  number  of  profile 
analysis  tasks  [Bernstein  and  Green ,  1987].  In  order  to  apply  the  model  to  the 
peak  stimuli  the  computational  scheme  was  slightly  modified.  For  instance,  we 
define  the  level  difference  between  the  ith  and  jth  frequency  component  as  A,j  = 

20  log  ((pi)  signal  j  (Pi)  standard)  20  log  ((Pj)  signal  /  (Pj)  standard)  •  Also,  Contrary  to  the 

assumption  of  the  original  model  ([Bernstein  and  Green ,  1987]),  we  take  the  <r’s  to 

2The  d  a'  values  were  also  computed  using  only  41  components,  or  in  effect  41  independent 
channels.  They  were  about  half  of  the  values  reported  in  Table  1(c)  for  161  components  and  are, 
therefore,  comparable  to  those  for  peak  profiles. 
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<f(T  for  5BWF  test 


11  Wlf  (octave) 

SF  0.1  0.2  0.1  average 


0 

3.11  ±0.21 

3.02  ±0.19 

2.58  ±0.21 

2.90 

0.1 

3.12  ±0.25 

3.16  ±0.20 

3.27  ±0.25 

3.18 

average  3.11  3.09  2.92 

6BWF/BWF  21.5  ±1.8%  20.0  ±1.1%  23.6  ±2.1% 

threshold 


& <7  for  tfSi1'  test 
11 WF  (octave) 


SF 

0.1 

0.2 

0.1 

average 

0 

2.%  ±0.19 

2.88  ±0.13 

3.97  ±0.18 

3.27 

0.1 

3.36  ±0.22 

3.22  ±0.15 

1.33  ±0.20 

3.61 

average  3.16  3.05  1.15 

5SF  0.16  ±0.01  0.11  ±0  005  0.11  ±0.005 
threshold 


(t<r  for  phase-ripple  experiment 

ripple  frequency  (cycle/octave) 

0.25  0.5  1  2  1  8 


d'a 

9.59 

10.11  8.89 

9.17 

19.0 

19.58 

pdl.  15  dll 

8.09“ 

9.08“  7.53“ 

8.1“ 

16.22“ 

13.17“ 

d'a  10.67  10.20  10.08  11.32  28.39 

pdl.  25  dll  5.59*  5.31“  5.11“  7.31“  11.53“ 


Table  I:  d'a  (dB)  values  for  the  “independent  channels  model”  (Sec.  IV  A),  (a) 
d! a  for  <SBWF  tests  are  evaluated  at  threshold  and  error  bar  limit  values  which  are 
given  at  the  bottom  of  each  table.  (For  example,  for  BWF  =  0.1  octave  and  SF= 
0,  £BWF/BWF  =  21.5%  with  error  bar  limits  of  ±  1.8%,  and  the  corresponding 
d'a  =  3.11  ±0.24.)  Thresholds  are  taken  from  Fig.  4(b)  (41  components  and  15  dB 
peak  level),  (b)  d'a  values  for  <5SF  tests.  The  table  is  organized  as  Table  1(a)  with 
threshold  values  from  Fig.  5(b)  (41  components  and  15  dB  peak  level),  (c)  d'a  for 
pdl  tests  with  ripple  profiles  for  15  dB  and  25  dB  ripple  levels  and  thresholds  from 
Fig.  4(b)  in  Part  I  (161  components).  (The  d'a  values  for  41  components,  for  ripple 
frequencies  between  0.25  -  4  cycle/octave,  are  half  of  the  values  reported  here  for 
161  components.) 
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be  constant  for  all  channels  and,  hence,  the  largest  d!  is  defined  by  the  largest  AhJ 
(defined  as  A). 

For  6BWF  tests  A  is  approximately  constant  for  all  SF’s  and  BWF’s  (Table 
11(a)).  Therefore,  the  model  predicts  well  the  perceptual  trends  assuming  constant 
<x’s  across  all  spectral  regions.  If  <r’s  are  allowed  to  increase  towards  the  edges  (as 
in  [Bernstein  and  Green ,  1987]),  d!  would  decrease  with  increasing  BWF  predicting 
erroneously  higher  thresholds  for  these  conditions. 

For  <$SF  tests  the  model  makes  two  erroneous  predictions  (Table  11(b)):  (1)  Like 
the  channel  model  it  predicts  a  small  consistent  decrease  in  thresholds  as  a  function 
of  SF  that  is  not  evident  in  the  data;  (2)  It  predicts  a  constant  phase  sensitivity  as 
a  function  of  BWF  for  all  peaks,  i.e.,  it  does  not  account  for  the  increase  in  (*)SF 
thresholds  for  the  narrowest  peaks  (BWF  =  0.1  octave). 

The  model  is  also  able  to  predict  a  constant  phase  sensitivity  for  low  frequency 
ripples  (<  1  cycle/octave)  when  applied  to  the  single  ripple  (Table  11(c)).  Therefore, 
unlike  the  channel  model  above,  this  model  consistently  predicts  the  constant  phase 
property  for  both  the  ripple  and  peak  profiles.  The  failure  of  the  model  to  predict 
the  declining  phase  sensitivity  at  high  ripple  frequencies  is  likely  because  it  does  not 
explicitly  incorporate  data  such  as  the  ripple-pdl  in  Fig.  4(b)  in  [  Vranic-Sowers  and 
Shamma ,  19xx]. 

C.  The  Ewaif  model 

Numerous  experimental  results  have  suggested  that  the  detection  of  spectral 
shape  changes  may  in  some  cases  be  effectively  mediated  by  pitch  cues  associated 
with  these  spectral  changes  ([Berg,  Nguyen  and  Green ,  1992;  Feth,  O’Malley  and 
Ramsey,  1982;  Richards,  Onsan  and  Green,  1989;  Stover  and  Feth ,  1983]).  In  order 
to  assess  the  possible  contribution  of  such  a  pitch  cue  in  our  tests,  we  measured 
the  effect  on  thresholds  of  randomizing  peak  locations  -  a  procedure  which  in  effect 
destroys  the  pitch  cue.  The  change  in  thresholds  was  then  compared  to  what  would 
be  predicted  from  the  theoretical  strength  of  the  pitch  cue  computed  for  each  test 
using  the  so-called  Ewaif  model  [Stover  and  Feth,  1983]. 

1.  Stimulus 

The  entire  spectral  content  was  randomly  shifted  in  order  to  prevent  listeners 
from  using  standard’s  and  signal’s  complex  pitches  for  spectral  shape  change  de¬ 
tection.  Frequency  shift  was  achieved  by  randomly  changing  the  sampling  time  in 
a  range  of  40  ys  to  45  ys  in  steps  of  0.5  ys.  This  amounts  to  shifting  the  central 
component  from  1000  Hz  to  889  Hz  and  all  the  other  components  accordingly  to 
preserve  the  frequency  spacing. 

The  measured  thresholds  and  the  corresponding  Ewaif  pitches  are  presented  in 
Tables  III.  The  Ewaif  pitches  were  computed  for  zero  phases  which  corresponds  to 
our  stimulus  condition. 


18 


A  for  flJWF  test 


BWF  (octave) 


SF 

0.1 

0.2 

0.1 

average 

0 

1.1.1 

1.37 

1.39 

1.16 

0.1 

1.1.1 

1.37 

1.39 

1.17 

average 

1.11 

1.37 

1.39 

flJWF/BWF 

threshold 

•x% 

23.6% 

A  for  *$F  test 
BWF  (octave) 


SF 

0.1 

0.2 

0.1 

average 

0 

2.39 

1.63 

1.66 

1.90 

0.1 

3.07 

2.01 

2.09 

2.10 

average 

2.73 

1.81 

1.87 

3SF 

threshold 

0.16 

0.11 

0.11 

A  for  phase-ripple  experiment 

ripple  frequency  (cvcle/octave) 

0.23  0.3  1  2  1  8 

A 

2.12 

2.38  1.97  2.12 

1.2.1  11.11 

pdl.  13  dll 

8.09" 

9.08°  7.33*  8.1* 

16.22"  13.17" 

A 

2.33 

2.32  2.23  3.16 

6.32 

pdl,  23  dll 

3.39* 

3  31"  3.11"  7.31* 

11.33" 

Table  II:  Maximal  difference  levels,  A  (dB),  for  the  “maximal  difference  model” 
(Sec.  IV  B).  Tables  are  organized  as  Tables  I. 
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2.  Assessing  the  data  using  the  Ewaif  model 

In  order  to  assess  the  amount  of  a  pitch  cue  contribution  to  the  detection  of 
changes  in  our  stimulus,  the  following  two  arguments  (from  [ Richards ,  Onsan  and 
Green ,  1989])  are  used:  1)  If  the  detection  process  relies  primarily  on  a  pitch  cue 
(as  defined  by  the  Ewaif  model),  then  some  minimal  pitch  difference,  AF  ([Feth  and 
Stover,  1987]),  or  relative  pitch  difference,  AF/Fa<a  ([Richards,  Onsan  and  Green, 
1989]),  is  necessary  for  detection.  Therefore,  AF  or  AF /Fsia  at  perceptual  thresholds 
should  remain  relatively  constant. 

2)  If  a  threshold  deterioration  occurs  due  to  the  uncertainty  in  the  randomized 
signal  and  not  due  to  the  pitch  differences  across  the  testing  conditions,  then  it 
should  be  uniform  across  all  conditions.  Otherwise,  the  deterioration  probably  re¬ 
flects  the  effective  contribution  of  the  pitch  cue.  This  is  evaluated  by  the  change  in 
values  of  the  NR-R  in  Tables  III. 

3.  Results  and  discussion 

(i)  Effects  on  detection  of  BWF  changes  ( Table  111(a)) 

The  AF  and  AF /F sta  values  vary  greatly  (approximately  7-fold)  across  the  SF’s 
and  BWF’s.  Note  also  a  change  in  sign  of  AF  across  various  testing  conditions. 
This  strongly  suggests  that  the  pitch  cue  plays  a  minimal  role  in  this  discrimination 
task.  Furthermore,  a  near  uniform  increase  of  the  thresholds  when  the  signal  is 
randomized  supports  the  notion  that  it  is  due  to  an  uncertainty  effect  rather  than 
an  abolishment  of  a  pitch  cue. 

(ii)  Effects  on  detection  of  SF  changes  (Table  111(b)) 

With  respect  to  the  first  argument  above,  it  is  clear  from  the  AF  and  AF/Fs(a 
values  in  Table  111(b)  that  not  all  pitch  cues  are  equal  at  threshold,  since  both  in¬ 
crease  approximately  4-fold  over  the  SF’s  and  BWF’s  tested.  However,  the  rise  in 
6SF  for  the  narrowest  peak  might  be  due  to  decreasing  pitch  cue.  This  is  further 
supported  by  the  data  with  respect  to  the  second  argument,  namely  that  the  ran¬ 
domization  affects  only  the  <$SF  thresholds  of  the  narrower  peaks.  Therefore,  the 
evidence  here  suggest  that  the  pitch  cue  may  be  effective  only  for  these  peaks. 

V.  DISCUSSION 

We  measured  subjects’  ability  to  detect  changes  in  the  shape  of  peak  profiles 
under  various  conditions.  Specifically,  our  goal  was  to  demonstrate  that,  for  profiles 
that  satisfy  the  two  constraints  outlined  in  Sec.  IV  B  of  [  Vranic-Sowers  and  Shamma, 
19xx],  thresholds  for  dilating  the  profile,  shifting  its  ripple  phase,  or  changing  its 
overall  amplitude,  are  rather  independent  of  the  details  of  the  profile.  More  gen¬ 
erally,  these  manipulations  can  be  interpreted  and  thresholds  predicted  within  the 
unified  framework  of  the  ripple  analysis  model. 
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0.1 

BWF  (octave) 

0.2 

0.4 

d>BWF  test 

0 

0.4 

SF 

0 

0.4 

0 

0.4 

NR 

0.30 

0.31 

0.32 

0.25 

0.23 

0.23 

R 

0.43 

0.42 

0.62 

0.53 

0.37 

0.36 

-(NR  H) 

0.13 

0.11 

0.30 

0.28 

0.14 

0.13 

^  ita 

12m  52 

1327.40 

1227.73 

1329.10 

1223.27 

1430.59 

1243.44 

1318.03 

1216.00 

1377.66 

1255.29 

1505.61 

^=F|(a'F  nj) 

38.65 

9.66 

11.69 

-49.96 

-26.78 

-73.16 

AF/F„„  •  100 

2.99 

0.73 

0.95 

-3  70 

-2.19 

-5.11 

0.1 

BWF  (octave) 

0.2 

0.4 

$SF  test 

0 

04 

SF 

0 

0.4 

0 

0.4 

NR 

0  27 

0.27 

0.13 

0.15 

0.13 

0.11 

R 

0  36 

0.44 

0.27 

0.23 

0  17 

0.12 

-(NR  R) 

0.09 

0.17 

0.14 

0.08 

0.04 

0.01 

F  «ta 

1290  ,12 

1327.40 

1227.73 

1329.10 

1223  27 

1430.59 

F IIQ 

1315.58 

1345.59 

1258.54 

1369.30 

1293.42 

1495.99 

AF=F.,„-F,„, 

-25  04 

-18.19 

-30.83 

-40.25 

-70.13 

-65.39 

AF/F.,„  ■  100 

-1.94 

-1  37 

-2  .U 

-3.03 

-.1.73 

-4.57 

Table  III:  (a)  £BWF/BWF  and  (b)  6SF  detection  thresholds  for  non-randomized 
(NR)  and  randomized  (R)  spectra  (Sec.  IV  C).  The  tables  are  organized  in  the 
same  way.  The  first  two  rows  are  the  NR  and  R  thresholds.  The  third  row  is  the 
difference  of  the  first  two.  The  forth  and  fifth  rows  are  the  computed  Ewaif  pitches 
of  standard  (Fsio)  and  signal  (Fs,-5)  at  perceptual  threshold  levels  for  NR  condition 
and  zero-phase  components.  The  AF  row  is  AF  =  Fs<a-F5lff.  The  last  row  is  the 
relative  pitch  difference,  AF/Fsta  in  percentage.  Note  in  (a)  a  change  in  sign  of  AF 
across  various  testing  conditions,  which  may  explain  the  change  in  strategies  that 
our  subjects  reported  in  performing  £BWF  task. 

Three  subjects  participated  in  <5BWF  and  two  in  £SF  series.  They  were  tested  at 
two  SF’s  (0,  0.4)  and  three  BWF’s  (0.1,  0.2,  0.4  octaves)  for  41  spectral  density 
complexes  and  at  15  dB  peak  level. 
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The  results  obtained  seem  on  the  whole  to  confirm  these  hypotheses.  Thus,  dila¬ 
tion  thresholds  for  peak  and  single  ripple  profiles  (fdl’s  )  are  similar  (Secs.  Ill  B  and 
II  A,  respectively).  Thresholds  for  changes  in  peak  symmetry  are  also  readily  inter¬ 
pretable  as  ripple  phase  sensitivity  and  are  close  to  those  obtained  from  changing 
the  phase  of  ripple  profiles  (Secs.  Ill  C  and  II  B,  respectively). 

Also  examined  in  this  paper  is  the  ability  of  other  profile  analysis  models  to  ac¬ 
count  for  the  measured  threshold  values  and  trends.  The  channel  model  fails  mostly 
in  the  phase  sensitivity  tests  (£SF  and  pdl  test  for  ripple  profiles),  where  its  predic¬ 
tion  of  trends  are  at  variance  with  the  measured  results.  The  maximum  difference 
model  faired  the  best  among  the  three  alternatives  considered.  On  the  whole  it  pre¬ 
dicts  correctly  the  trends  in  all  dilation  thresholds  and  most  phase  sensitivity  tests. 
The  most  obvious  flaw  is  its  inability  to  predict  the  deterioration  of  phase  sensitiv¬ 
ity  for  higher  ripple  frequencies  and  narrow  peak  profiles.  It  is  conceivable  that  the 
model  can  be  modified  to  take  these  measurements  into  account.  For  example,  a 
suitable  smoothing  of  the  spectral  profile  may  reduce  the  effective  amplitude  of  the 
higher  ripple  components,  in  effect  raising  their  thresholds.  Finally,  the  Ewaif  model 
cannot  account  for  the  <*)BWF  thresholds  and  accounts  only  partially  for  the  ASF 
measurements.  Clearly,  these  criticisms  apply  only  to  the  specific  formulations  of 
the  models  considered  here  and  it  is  possible  that  future  modifications  will  improve 
their  predictions  significantly. 

Note,  however,  that  the  maximum  difference  model  was  originally  developed 
specifically  to  predict  idl-type  tests  and  not  the  fdl-  and  pdl-type  tests  discussed 
here.  Therefore,  a  better  comparison  of  the  ripple  analysis  and  the  maximum  differ¬ 
ence  models  might  be  to  account  for  the  detection  threshold  of  a  profile  composed 
of  several  ripples  that  do  not  appreciably  interact  within  the  same  filter  (e.g.,  sep¬ 
arated  by  more  than  an  octave).  For  different  phases  of  the  ripples  the  shape  of 
the  profile  changes  and  so  too  the  predicted  thresholds  of  the  maximum  difference 
model.  In  contrast,  the  ripple  analysis  model  predicts  that  detection  thresholds  are 
independent  of  the  relative  phases  of  the  ripples.  A  specific  example  of  such  a  pro¬ 
file  is  the  square  wave  which  is  composed  of  a  large  fundamental  ripple  and  smaller 
odd  harmonics.  The  maximum  difference  model  predicts  the  threshold  based  on  the 
amplitude  of  the  square  wave,  whereas  the  ripple  analysis  model  predicts  it  based  on 
the  amplitude  of  the  fundamental  (largest)  ripple  component.  The  two  amplitudes 
differ  by  a  small  (hopefully  measurable)  factor  of  4/w. 

In  summary,  the  ripple  analysis  model  is  able  to  integrate  efficiently  data  from 
a  diverse  set  of  profile  analysis  tests.  It  also  provides  a  descriptive  framework  and  a 
computationally  tractable  formulation  to  conceive  of  and  test  new  hypotheses.  But 
perhaps  the  most  important  impetus  for  pursuing  this  type  of  profile  analysis  model 
is  the  recent  physiological  findings  on  the  functional  organization  of  the  auditory 
cortex  [Calhoun  and  Schreiner ,  1993;  Shamma,  Versnel  and  Kowalski ,  1993].  These 
results  support  the  notion  that  the  auditory  system  performs  a  ripple  analysis  of 
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the  spectral  profile  via  ripple  frequency  and  phase  selective  filters.  The  linear  ripple 
analysis  model  presented  here  is  a  simplified  formulation  of  this  type  of  auditory 
processing. 
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Appendix  A  Adding  a  constant  phase  to  the  rip¬ 
ple  spectrum  of  the  profile 


Consider  the  profile  p(o>)  whose  Fourier  transform  is  P(Cl): 

1  r°°  ~ 

p(oj)  =  -! -  /  p(ny2*n"d( 2nn). 

Z7T  J — oo 

Adding  a  constant  phase  angle  90  to  all  the  transform  components  changes  the  profile 
to: 

/0  too 

P(n)eje°ej2irQwdCl  +  /  P(f))e"JV2irnu,dft, 

-oo  J  0 

where  the  integral  is  split  to  emphasize  that  the  phase  function  (added  to  negative 
frequencies  and  subtracted  from  positive  frequencies)  must  be  odd  as  a  function 
of  Cl  in  order  for  pBo  to  remain  real.  This  expression  can  be  simplified  further  by 
substituting  e±J(?0  =  cos(0o)  ±  j  sin(0o),  and  collecting  terms: 


pBo(uj)  =  cos(0o)  r  P(n)e32*Q“dCl  -  sin(#0)  r  jP(n)  •  sign{Cl )  •  ej2*Q“dCl. 

J  —  OO  J  —  oo 

Therefore, 

Pe  „H  =  cos(60)p(u)  +  sin  (90)H{p(u>)), 

where  7i(p(u))  is  the  so-called  Hilbert  transform  of  p{oo).  This  is  the  expression 
used  in  computing  the  profiles  in  Fig.  3(a).  A  simpler  expression  can  be  used  for 
the  case  of  small  0O  (cos(0o)  :=s  1  and  sin(0o)  0O ): 

Pe  oM  =  pH  +  90H(p{u)).  (Al) 


Another  way  of  changing  the  symmetry  of  the  peak  p(u>)  is  by  changing  its  SF 
by  some  6SF.  The  new  tilted  peak  is: 


Pssf(u)  =  b  + 


(u _ 


—  b  - 1-  CLr 


_ 

10 20  (1+») 


(ui  —  Wo) 


amax 

where  u>  >  u0  and  the  approximation  holds  for  6SF  C  1  +  SF.  To  simplify  the 
algebra,  let  the  peak  be  centered  at  u>0  =  0  with  6=1,  and  let  the  starting  peak  be 
symmetric  (SF  =  0)  with  slopes  S  (defined  in  Sec.  I  A).  Then: 


P*sfM  =  pH  +  amax  10™"  (10  ™  5SF  "  -  1) 

=  pH  +  (pH  -  1)  (e^1 5SF  "  ,nl°  -  1) 
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«  p{ u)  +  6SF  •  [(p( u)  -  1)  •  (\S\  u)  ■  In  10/20],  (A2) 

Comparing  the  Eqs.  (Al)  and  (A2),  it  is  evident  that  to  interpret  £SF  as  equal  to 
the  ripple  phase  shift  90,  we  must  have: 

H(p(u))  «  (p(o>)  -  1)  •  (\S\  u)  ■  In  10/20.  (A3) 

This  equivalence  is  demonstrated  in  Fig.  Al  where  the  two  expressions  are  plotted 
together.  The  correspondence  between  pssf(^)  and  pe0(u)  is  further  illustrated  in 
Figs.  3  (text). 


Figure  Al:  Comparing  the  two  sides  of  Eq.  (A3)  for  a  peak  with  SF=0,  BWF=0.2 
octave,  and  Amax  =  15  dB  amplitude  (or  b  +  amax  =  6.62).  The  solid  line  is  the 
Hilbert  transform  and  the  dashed  line  is  the  right-hand  side  of  the  Eq.  (A3). 
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Appendix  B  Dependence  on  peak  amplitude  and 

spectral  density 


1.  Detection  of  change  in  bandwidth 

(i)  Dependence  on  peak  amplitude 

The  dependence  of  6BWF/BWF  thresholds  on  peak  levels  was  examined  using 
10  dB,  15  dB  ,  and  20  dB  peak  amplitudes.  The  average  results  of  three  subjects  are 
shown  as  a  function  of  SF  and  then  as  a  function  of  BWF  in  Figs.  Bl(a)  and  (b), 
respectively.  (One  of  the  three  subjects  also  participated  in  6BWF  tests  reported  in 
Sec.  Ill  B.) 

The  plots  confirm  that,  at  a  particular  level,  the  6BWF /BWF  threshold  is  largely 
independent  of  peak  shape.  However,  thresholds  do  vary  as  a  function  of  peak  level 
but  mostly  at  lower  peak  levels.  For  instance,  on  average,  the  rate  of  threshold  rise 
in  going  from  20  dB  to  15  dB  peaks  is  less  than  half  of  that  seen  between  15  dB  and 
10  dB  peaks. 

(ii)  Spectral  density  dependence 

These  experiments  explored  threshold  dependence  on  the  spectral  density  of  the 
complex  while  keeping  total  base  bandwidth  constant  (0.2-5  kHz).  Peak  level  was 
kept  at  15  dB  level  and  11,  21,  and  41  component  complexes  were  used.  The  average 
results  of  three  subjects  are  presented  in  Figs.  B2  as  a  function  of  SF  in  (a)  and 
BWF  in  (b). 

The  6BWF/BWF  thresholds  remain  constant  for  almost  all  conditions  tested, 

1. e.,  regardless  of  peak  shape  and  spectral  density.  The  one  obvious  exception  is  at 
the  broadest  peak  for  the  11  component  case,  where  thresholds  increase  significantly. 

2.  Detection  of  change  in  symmetry 

(1)  Dependence  on  peak  amplitude 

These  experiments  tested  £SF  threshold  dependence  on  peak  amplitude  for  var¬ 
ious  testing  conditions  (Figs.  B3).  The  data  are  averaged  over  the  BWF’s  in 
Fig.  B3(a)  and  over  the  SF’s  in  Fig.  B3(b)  for  each  of  the  peak  levels. 

Two  conclusions  can  be  derived  from  these  data:  (1)  The  same  trends  described 
earlier  hold  regardless  of  peak  levels.  Thus,  except  for  the  narrowest  peak,  all  <*>SF 
thresholds  are  the  same  regardless  of  peak  shapes  studied.  Note  that  on  average,  the 
three  subjects  tested  here  exhibited  uniformly  higher  thresholds  than  the  earlier  five 
in  Sec.  Ill  C.  (One  of  the  subjects  participated  in  <5SF  tests  reported  in  Sec.  Ill  C.) 

(2)  6SF  thresholds  (as  a  function  of  BWF)  deteriorate  faster  at  the  narrowest  than 
other  peaks  with  decreasing  peak  level  (Fig.  B3(b)).  This  rise  is  largely  responsible 
for  the  upward  shift  in  the  mean  of  <5SF’s  (as  a  function  of  SF,  Fig.  3(a))  for  smaller 
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Figure  Bl:  Bandwidth  factor  change  detection  thresholds,  (5BWF/BWF,  for  41 
component  complex  were  measured  for  three  subjects  over  the  following  conditions: 
three  SF’s  (0,  0.2,  0.4),  three  BWF’s  (0.1,  0.2,  0.4  octaves),  and  at  three  peak  levels 
( Amax  =  10  dB,  15  dB,  20  dB).  The  dBWF/BWF  thresholds  are  shown  as  a  function 
of  SF  in  (a)  and  BWF  in  (b).  Points  are  offset  along  the  abscissa  for  clarity. 
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Figure  B2:  £BWF/BWF  thresholds  for  41,  21,  and  11  component  complexes  and 
15  dB  peak  level  are  averaged  over  three  subjects  and  three  BWF’s  (0.1,  0.2,  0.4 
octaves)  in  (a)  and  two  SF’s  (0,  0.4)  in  (b).  Threshold  is  independent  of  spectral 
density  for  all  but  the  broadest  BWF,  where  it  increases  for  the  11  component  case. 
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BWF 

Figure  B3:  Symmetry  factor  change  detection  thresholds,  <5SF,  for  41  component 
complex  and  three  peak  amplitudes  (10  dB,  15  dB,  20  dB).  At  each  peak  level  the 
data  are  averaged  over  three  subjects  and  three  BWF’s  (0.1,  0.2,  0.4  octaves)  in  (a) 
and  three  SF’s  (0,  0.2,  0.4)  in  (b).  The  large  error  bars  in  (a)  are  due  to  the  <5SF 
threshold  increase  for  the  narrowest  peak  (BWF=0.1  octave)  seen  in  (b).  Points  are 
slightly  offset  along  the  abscissa  for  clarity. 
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peak  levels.  The  overall  slight  rise  in  thresholds  may  reflect  the  masking  of  the  peak 
by  the  base,  which  presumably  increases  for  lower  peak  levels. 

(ii)  Spectral  density  dependence 

Once  again,  all  <$SF  values  and  trends  described  earlier  largely  hold  regardless  of 
spectral  densities  (Figs.  B4).  The  most  prominent  change  in  dSF  thresholds  occurs 
as  a  function  of  spectral  density  at  the  narrowest  peak  (Fig.  B4(b)).  The  threshold 
here  deteriorates  rapidly  as  the  spectral  density  decreases  and,  as  in  Fig.  B3(a),  it 
is  largely  this  accelerated  rise  that  is  responsible  for  the  upward  shifts  in  the  mean 
of  <*)SF  in  Fig.  B4(a). 
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Figure  B4:  <$SF  thresholds  for  41,  21,  and  11  component  complexes  and  15  dB  peak 
level  are  averaged  over  four  subjects  and  three  BWF’s  (0.1,  0.2,  0.4  octaves)  in  (a) 
and  three  SF’s  (0,  0.2,  0.4)  for  41  and  21  component  tests  and  two  SF’s  (0,  0.4)  for 
11  component  case  in  (b).  As  in  Fig.  B3,  the  large  error  bars  in  (a)  are  due  to  <5SF 
increase  for  the  narrowest  peak  seen  in  (b).  Note  that  in  (b),  most  £SF  changes 
with  spectral  density  occur  at  the  narrowest  peak. 
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